Search CORE

10 research outputs found

Search Rank Fraud De-Anonymization in Online Systems

Author: Akoglu Leman
Akoglu Leman
Fei Geli
Kaghazgaran Parisa
Karger David R
Mukherjee Arjun
Ott Myle
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 23/06/2018
Field of study

We introduce the fraud de-anonymization problem, that goes beyond fraud detection, to unmask the human masterminds responsible for posting search rank fraud in online systems. We collect and study search rank fraud data from Upwork, and survey the capabilities and behaviors of 58 search rank fraudsters recruited from 6 crowdsourcing sites. We propose Dolos, a fraud de-anonymization system that leverages traits and behaviors extracted from these studies, to attribute detected fraud to crowdsourcing site fraudsters, thus to real identities and bank accounts. We introduce MCDense, a min-cut dense component detection algorithm to uncover groups of user accounts controlled by different fraudsters, and leverage stylometry and deep learning to attribute them to crowdsourcing site profiles. Dolos correctly identified the owners of 95% of fraudster-controlled communities, and uncovered fraudsters who promoted as many as 97.5% of fraud apps we collected from Google Play. When evaluated on 13,087 apps (820,760 reviews), which we monitored over more than 6 months, Dolos identified 1,056 apps with suspicious reviewer groups. We report orthogonal evidence of their fraud, including fraud duplicates and fraud re-posts.Comment: The 29Th ACM Conference on Hypertext and Social Media, July 201

arXiv.org e-Print Archive

Crossref

Automated Crowdturfing Attacks and Defenses in Online Review Systems

Author: Arisoy Ebru
Fei Geli
Kakhki Arash Molavi
Kim Gyuwan
Lee Kyumin
Lee Kyumin
Li Fangtao
Maas Andrew L.
Maxwell Harper F.
Mukherjee Arjun
Sutskever Ilya
Publication venue
Publication date: 07/09/2017
Field of study

Malicious crowdsourcing forums are gaining traction as sources of spreading misinformation online, but are limited by the costs of hiring and managing human workers. In this paper, we identify a new class of attacks that leverage deep learning language models (Recurrent Neural Networks or RNNs) to automate the generation of fake online reviews for products and services. Not only are these attacks cheap and therefore more scalable, but they can control rate of content output to eliminate the signature burstiness that makes crowdsourced campaigns easy to detect. Using Yelp reviews as an example platform, we show how a two phased review generation and customization attack can produce reviews that are indistinguishable by state-of-the-art statistical detectors. We conduct a survey-based user study to show these reviews not only evade human detection, but also score high on "usefulness" metrics by users. Finally, we develop novel automated defenses against these attacks, by leveraging the lossy transformation introduced by the RNN training and generation cycle. We consider countermeasures against our mechanisms, show that they produce unattractive cost-benefit tradeoffs for attackers, and that they can be further curtailed by simple constraints imposed by online service providers

arXiv.org e-Print Archive

Crossref

Exploiting Burstiness in Reviews for Review Spammer Detection

Author: Castellanos Malu
Fei Geli
Ghosh Riddhiman
Hsu Meichun
Liu Bing
Mukherjee Arjun
Publication venue: 'Association for the Advancement of Artificial Intelligence (AAAI)'
Publication date: 28/06/2013
Field of study

Online product reviews have become an important source of user opinions. Due to profit or fame, imposters have been writing deceptive or fake reviews to promote and/or to demote some target products or services. Such imposters are called review spammers. In the past few years, several approaches have been proposed to deal with the problem. In this work, we take a different approach, which exploits the burstiness nature of reviews to identify review spammers. Bursts of reviews can be either due to sudden popularity of products or spam attacks. Reviewers and reviews appearing in a burst are often related in the sense that spammers tend to work with other spammers and genuine reviewers tend to appear together with other genuine reviewers. This paves the way for us to build a network of reviewers appearing in different bursts. We then model reviewers and their co-occurrence in bursts as a Markov Random Field (MRF), and employ the Loopy Belief Propagation (LBP) method to infer whether a reviewer is a spammer or not in the graph. We also propose several features and employ feature induced message passing in the LBP framework for network inference. We further propose a novel evaluation method to evaluate the detected spammers automatically using supervised classification of their reviews. Additionally, we employ domain experts to perform a human evaluation of the identified spammers and non-spammers. Both the classification result and human evaluation result show that the proposed method outperforms strong baselines, which demonstrate the effectiveness of the method

Association for the Advancement of Artificial Intelligence: AAAI Publications

DeepTrust: An Automatic Framework to Detect Trustworthy Users in Opinion-Based Systems

Author: Akoglu Leman
Fei Geli
Kumar Srijan
Mukherjee Arjun
Richardson Matthew
Wang Guan
Zacharia Giorgos
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/03/2020
Field of study

Opinion spamming has recently gained attention as more and more online platforms rely on users’ opinions to help potential customers make informed decisions on products and services. Yet, while work on opinion spamming abounds, most efforts have focused on detecting an individual reviewer as spammer or fraudulent. We argue that this is no longer sufficient, as reviewers may contribute to an opinion-based system in various ways, and their input could range from highly informative to noisy or even malicious. In an effort to improve the detection of trustworthy individuals within opinion-based systems, in this paper, we develop a supervised approach to differentiate among different types of reviewers. Particularly, we model the problem of detecting trustworthy reviewers as a multi-class classification problem, wherein users may be fraudulent, unreliable or uninformative, or trustworthy. We note that expanding from the classic binary classification of trustworthy/untrustworthy (or malicious) reviewers is an interesting and challenging problem. Some untrustworthy reviewers may behave similarly to reliable reviewers, and yet be rooted by dark motives. On the contrary, other untrustworthy reviewers may not be malicious but rather lazy or unable to contribute to the common knowledge of the reviewed item. Our proposed method, DeepTrust, relies on a deep recurrent neural network that provides embeddings aggregating temporal information: we consider users’ behavior over time, as they review multiple products. We model the interactions of reviewers and the products they review using a temporal bipartite graph and consider the context of each rating by including other reviewers’ ratings of the same items. We carry out extensive experiments on a realworld dataset of Amazon reviewers, with known ground truth about spammers and fraudulent reviews. Our results show that DeepTrust can detect trustworthy, uninformative, and fraudulent users with an F1-measure of 0.93. Also, we drastically improve on detecting fraudulent reviewers (AUROC of 0.97 and average precision of 0.99 when combining DeepTrust with the F&G algorithm) as compared to REV2 state-of-the-art methods (AUROC of 0.79 and average precision of 0.48). Further, DeepTrust is robust to cold start users and overperforms all existing baselines

Crossref

Boise State University - ScholarWorks

The Wisdom of the Gaming Crowd

Author: Benamara Farrah
Clayton
Devlin Jacob
Eberhard L.
Fei Geli
Fosset Jeffrey
Fusilier Donato Hernández
Games Steam
Le Quoc
Nyaung Dim En
Ott Myle
Ott Myle
Sobkowicz Antoni
Store Steam
Sweetser Penelope
Wijman Tom
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2020
Field of study

In this paper, we report on three projects in which we are applying natural language processing techniques to analyse video game reviews. We present our process, techniques, and progress for extracting and analysing player reviews from the gaming platform Steam. Analysing video game reviews presents great opportunity to assist players to choose games to buy, to help developers to improve their games, and to aid researchers in further understanding player experience in video games. With limited previous research that specifically focuses on game reviews, we aim to provide a baseline for future research to tackle some of the key challenges. Our work shows promise for using natural language processing techniques to automatically identify features, sentiment, and spam in video game reviews on the Steam platform

Crossref

The Australian National University

Characterization of a novel thermophilic pyrethroid-hydrolyzing carboxylesterase from Sulfolobus tokodaii into a new family

Author: Altschul
Arpigny
Bornscheuer
Brock
Charbonneau
Chiang
du Plessis
Duobin Mao
Fan
Fei Zhang
Fisher
Garey
Geli Ma
George
Handrick
Horchani
Jaeger
Jaeger
Kasai
Kawarabayasi
Kelley
Kim
Larkin
Laskowski
Levisson
Li
Liang
Montoro-García
Peixin He
Rusnak
Schmidt-Dannert
Shen
Shengxue Feng
Stock
Suzuki
Tamura
Tao Wei
Tirawongsaroj
Wang
Wei
Winkler
Wu
Xuan Yu
Yamagishi
Yulong Shen
Zhai
Zhang
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref